Ensemble Methods – Classifier Combination in Machine Learning
نویسنده
چکیده
The last ten years have seen a research explosion in machine learning. The rapid growing is largely driven by the following two forces. First, separate research communities in symbolic machine learning, computational learning theory, neural network, statistics and pattern recognition have discovered one another and begun to work together. Second, machine learning technologies are being applied to new kinds of problems as well as in more traditional problems such as speech recognition, face detection and recognition, handwriting recognition, medical data analysis, biometrics, data mining and game playing etc. [Jain00]. Dietterich summarized four of these directions in his review paper [Dietterich97], one of them is ensembles of classifiers, the other three are scaling-up supervised learning, reinforcement learning and learning stochastic models. Ensemble methods are learning algorithms that construct a set of classifiers and then classify new data points by taking a (weighted) vote of their predictions [Dietterich00]. It has been applied to a wide range of real problems, such as object detection and recognition. As a multiple learner system, it tries to exploit the local different behavior of the base learners to enhance the accuracy and the reliability of the overall inductive learning system. There are also hopes that if some learner fails, the overall system can recover the error. Basically the effectiveness of ensemble methods relies on the independence of the error committed by the component base learner. From a general standpoint we know that the effectiveness of ensemble methods depends on the accuracy and the diversity of the base learners. In fact, there is a trade-off between accuracy and independence: more accurate are the base learners, less independent they are. Technically ensembles can enlarge the effective hypotheses coverage, expanding the space of representable functions.
منابع مشابه
Fault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods
Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...
متن کاملApplication of ensemble learning techniques to model the atmospheric concentration of SO2
In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...
متن کاملCombining Classifier Guided by Semi-Supervision
The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...
متن کاملCombining Classifier Guided by Semi-Supervision
The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...
متن کاملImproving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering
Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...
متن کاملEmpirical Evaluation of Ensemble Techniques for a Pittsburgh Learning Classifier System
Ensemble techniques have proved to be very useful to boost the performance of several types of machine learning methods. In this paper, we illustrate its usefulness in combination with GAssist, a Pittsburgh-style Learning Classifier System. Two types of ensemble are tested. First baggingstyle consensus prediction. Second an ensemble intended to deal more efficiently with ordinal classification ...
متن کامل